Cuneiform Detection in Vectorized Raster Images
نویسندگان
چکیده
Documents written in cuneiform script are one of the largest sources about ancient history. The script is written by imprinting wedges (Latin: cunei) into clay tablets and was used for almost four millennia. This three-dimensional script is typically transcribed by hand with ink on paper. These transcriptions are available in large quantities as raster graphics by online sources like the Cuneiform Database Library Initative (CDLI). Within this article we present an approach to extract Scalable Vector Graphics (SVG) in 2D from raster images as we previously did from 3D models. This enlarges our basis of data sets for tasks like word-spotting. In the first step of vectorizing the raster images we extract smooth outlines and a minimal graph representation of sets of wedges, i.e., main components of cuneiform characters. Then we discretize these outlines followed by a Delaunay triangulation to extract skeletons of sets of connected wedges. To separate the sets into single wedges we experimented with different conflict resolution strategies and candidate pruning. A thorough evaluation of our methods and its parameters on real word data shows that the wedges are extracted with a true positive rate of 0.98. At the same time the false positive rate is 0.2, which requires future extension by using statistics about geometric configurations of wedge sets.
منابع مشابه
Multidirectional Scanning Model, MUSCLE, to Vectorize Raster Images with Straight Lines
This paper presents a new model, MUSCLE (Multidirectional Scanning for Line Extraction), for automatic vectorization of raster images with straight lines. The algorithm of the model implements the line thinning and the simple neighborhood methods to perform vectorization. The model allows users to define specified criteria which are crucial for acquiring the vectorization process. In this model...
متن کاملAutomatic Road Extraction from High Resolution Satellite Images Using Neural Networks, Texture Analysis, Fuzzy Clustering and Genetic Algorithms
In this article, a new method for road extraction from high resolution Quick Bird and IKONOS pan-sharpened satellite images is presented. The proposed methodology consists of two separate stages of road detection and road vectorization. Neural networks are applied on high resolution IKONOS and Quick-Bird images for road detection. This paper has endeavoured to optimize neural networks’ function...
متن کاملSelf extracting SVG Rendering of Compressed Raster Images Vectorized by DDT Triangulation
This paper presents a portable compression algorithm applied to raster data images properly triangulated by using DDT (Data Dependent Triangulation). In particular the input source data are encoded to be rendered by standard SVG (Scalable Vector Graphics) engine. The proposed compression strategy works reducing the overall entropy implementing some heuristic strategies to properly re-code the r...
متن کاملCompressed SVG Representation of Raster Images Vectorized by DDT Triangulation
This paper presents a portable compression algorithm applied to raster data images properly triangulated by using DDT (Data Dependent Triangulation). In particular the input source data are encoded to be rendered by standard SVG (Scalable Vector Graphics) engine. The proposed compression strategy works reducing the overall entropy implementing some heuristic strategies to properly re-code the r...
متن کاملThe Usability of Vectorization and a New Point Matching Procedure as First Step in Conflating Raster and Vector Maps
The growing fields of GIS application (local administration, tourism, archaeology, geology, ...) has increased the interest in studying the information sharing from different geographic databases, also known as “GIS data interoperability”. This is a huge problem, involving several aspects. Among them, we decided to concentrate an a geometric conflation of different maps since maps coming from d...
متن کامل